rank | frequency | n-gram |
---|---|---|
1 | 8348 | -ת |
2 | 7408 | -ה |
3 | 7001 | -ם |
4 | 3681 | -ו |
5 | 3528 | -י |
rank | frequency | n-gram |
---|---|---|
1 | 5332 | -ים |
2 | 4243 | -ות |
3 | 1473 | -ית |
4 | 1238 | -יה |
5 | 933 | -ון |
rank | frequency | n-gram |
---|---|---|
1 | 909 | -יות |
2 | 792 | -יים |
3 | 728 | -רים |
4 | 614 | -נים |
5 | 495 | -רות |
rank | frequency | n-gram |
---|---|---|
1 | 206 | -ונים |
2 | 178 | -ורים |
3 | 165 | -ונות |
4 | 138 | -ניות |
5 | 134 | -ניים |
rank | frequency | n-gram |
---|---|---|
1 | 53 | -ותיהם |
2 | 48 | -לוגיה |
3 | 46 | -וניים |
4 | 45 | -וניות |
5 | 36 | -גרפיה |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings